Search CORE

1 research outputs found

WEB SCALE INFORMATION EXTRACTION USING WRAPPER INDUCTION APPROACH

Author: GADGE JAYANT
ZAMBAD RINA
Publication venue: Institute for Project Management Pvt. Ltd
Publication date: 08/09/2020
Field of study

Information extraction from unstructured, ungrammatical data such as classified listings is difficult because traditional structural and grammatical extraction methods do not apply. The proposed architecture extracts unstructured and un-grammatical data using wrapper induction and show the result in structured format. The source of data will be collected from various post website. The obtained post data pages are processed by page parsing, cleansing and data extraction to obtain new reference sets. Reference sets are used for mapping the user search query, which improvised the scale of search on unstructured and ungrammatical post data. We validate our approach with experimental results

Interscience Research Network